An Index-Based Method for Timestamped Event Sequence Matching
نویسندگان
چکیده
This paper addresses the problem of timestamped event sequence matching, a new type of sequence matching that retrieves the occurrences of interesting patterns from a timestamped event sequence. Timestamped event sequence matching is useful for discovering temporal causal relationships among timestamped events. In this paper, we first point out the shortcomings of prior approaches to this problem and then propose a novel method that employs an R∗-tree to overcome them. To build an R∗-tree, it places a time window at every position of a timestamped event sequence and represents each window as an n-dimensional rectangle by considering the first and last occurrence times of each event type. Here, n is the total number of disparate event types that may occur in a target application. When n is large, we apply a grouping technique to reduce the dimensionality of an R∗-tree. To retrieve the occurrences of a query pattern from a timestamped event sequence, the proposed method first identifies a small number of candidates by searching an R∗tree and then picks out true answers from them. We prove its robustness formally, and also show its effectiveness via extensive experiments.
منابع مشابه
A multi-dimensional indexing approach for timestamped event sequence matching
This paper addresses the problem of timestamped event sequence matching, a new type of similar sequence matching that retrieves the occurrences of interesting patterns from timestamped sequence databases. The sequential-scan-based method, the trie-based method, and the method based on the iso-depth index are well-known approaches to this problem. In this paper, we point out their shortcomings, ...
متن کاملQuerying Timestamped Event Sequences by Exact Search or Similarity-based Search: Design and Empirical Evaluation
Specifying timestamped event sequence queries is challenging even for skilled computer professionals familiar with SQL. Most graphical user interfaces for database search use a exact search approach, which is often effective, but applies an exact match criteria. We describe a new similarity-based search interface, in which users specify a query by simply placing events on a blank timeline and r...
متن کاملMeasurement of Left Ventricular Myocardium Wall Instantaneous Motions with Echocardiographic Sequence Images
Background & Aims: One of the important aims of quantitative cardiac image processing is the clarification of myocardial motions in order to derive biomechanical behavior of the heart in the disease condition. In this study we presented a computerized analysis method for detecting the instantaneous myocardial changes by using 2D echocardiography images. Methods: The analysis was performed on th...
متن کاملAn edit operation-based approach to approximate string matching in large DNA databases
In DNA related research, due to various environment conditions, mutations occur very often, where a mutation is defined as a heritable change in the DNA sequence. Therefore, approximate string matching is applied to answer those queries which find mutations. The problem of approximate string matching is that given a user specified parameter, k, we want to find where the substrings, which could ...
متن کاملHow to improve efficiency of analysis of sequential data?
Many of todays database applications, including market basket analysis, web log analysis, DNA and protein sequence analysis utilize databases to store and retrieve sequential data. Commercial database management systems allow to store sequential data, but they do not support efficient querying of such data. To increase the efficiency of analysis of sequential data new index structures need to b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005